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parameters, wherein the update pad conditioning parameters are 
calculated using a pad wear and conditioning model that predicts 
the wafer material removal rate of the polishing pad based upon 
conditioning parameters, such as the conditioning down force and 
rotational speed of the conditioning disk. 
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Feedforward and Feedback Control for Conditioning 
of Chemical Mechanical Polishing Pad 

Related Applications 

This application claims priority from U.S. application serial number 60/298,878, filed June 19, 
5 2001 and entitled "Advanced Process Control for Semiconductor Manufacturing Process," U.S. 
application serial number 60/305,782, filed July 16, 2001 and entided "Feedforward and Feedback 
Control for Conditioning of Chemical Mechanical Polishing Pad," U.S. application serial number 
60/318,741, filed September 12, 2001 and entitled **Feedforward and Feedback Control for 
Conditioning of Chemical Mechanical Polishing Pad," and U.S. application serial number 09/998,384, 
10 filed November 30, 2001 and entitled "Feedforward and Feedback Control for Conditioning of 
Chemical Mechanical Polishing Pad," all of which are incorporated herein by reference. 

Field of the Invention 

The present invention generally relates to the area of polishing and methods for improving the 
life of polishing pads. 

15 Background of the Invention 

Chemical-mechanical polishing (CMP) is used in semiconductor fabrication processes for 
obtaining full planarization of a semiconductor wafer. The method involves removing noaterial (e.g., a 
sacrificial layer of surface material) from the wafer, (typically silicon dioxide (SiO^) using mechanical 
contact and chemical erosion from (e.g., a moving polishmg pad saturated with slurry). Polishing 

20 flattens out height differences, since areas of high topography (hills) are removed faster than areas of 
low topography (valleys). FIG. 1 A shows a top view of a CMP machine 100 and FIG. IB shows a side 
section view of the CMP machine 100 taken through line AA. The CMP machine 100 is fed wafers to 
be polished. Typically, the CMP machine 100 picks up a wafer 105 with an arm 101 and places it onto 
a rotating polishing pad 102. The polishing pad 102 is made of a resilient material and is often 

25 textured, to aid the polishing process. The polishing pad 102 rotates on a platen 104 or turn table 

located beneath the polishing pad 102 at a predetermined speed. The wafer 105 is held in place on the 
polishing pad 102 by the arm 101. The lower surface of the wafer 105 rests against the polishing pad 
102. The upper surface of the wafer 105 is against the lower surface of the wafer carrier 106 of arm 
101. As the polishing pad 102 rotates, the arm 101 rotates the wafer 105 at a predetermined rate. The 

30 arm 101 forces the wafer 105 into the polishing pad 102 with a predetermined amount of down force. 
The CMP machine 100 also includes a slurry dispense arm 107 extending across the radius of the 
polishing pad 102. The slurry dispense arm 107 dispenses a flow of slurry onto the polishing pad 102. 

It is known that the material removal rate provided by a given polishing pad decreases 
exponentially with time in the manner shown in FIG. 2. As a consequence, the poUshing pad must be 

35 conditioned (e.g., using a conditioning disk 108), between polishing cycles. Doing so roughens the 
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surface of the pad and restores, at least temporarily, its original material removal rate. When the pad 
can no longer be reconditioned, it is replaced. 

A problem with conventional conditioning methods is that they may over condition, e.g., wear 
out, the planarizing surface, and thus may reduce the pad life of the polishing pads. Because of 
5 variation in material removal rates from pad to pad, the CMP tool must be recalibrated to achieve a 
desired material removal rate each time a pad is changed. The production time lost during pad changes 
translates into processing delays and lost efficiency. 

In an attempt to extend the life of the pad, various methods are reported for selectively 
conditioning a polishing pad, and for varying the down force of the conditioning element (e.g., 

10 conditioning disk 108) along the surface of the CMP pad based upon the likely or perceived distribution 
of unacceptable pad conditions across the planarizing surface. Other methods report varying the 
conditioning recipe across the surface of the polishing pad in response to polishing pad non- 
uniformities. However, these reported CMP processes are typically more concerned with improving 
the CMP process, e.g., improving within water non-uniformity, than in extending pad life. 

15 Methods and devices that would extend pad life and therefore reduce the frequency of pad 

replacement offer significant cost savings to the wafer fabrication process. 

Summary of the Invention 
The present invention relates to a method, apparatus and medium for conditioning a planarizing 
surface of a polishing pad in order to extend the working life of the pad. The present invention uses 

20 physical and chemical models (which can be implemented as a single model or multiple models) of the 
pad wear and planarization processes to predict polishing pad performance and to extend pad life. This 
results in an increase in the number of semiconductor wafer or other substrates that can be polished 
with a single polishing pad, thereby providing significant cost savings in the CMP process, both in 
reducing the number of pads needed and the time devoted to pad replacement. 

25 The model predicts polishmg effectiveness (wafer material removal rate) based on the 

^'conditioning"' operating parameters of the conditioning process. In at least some embodiments of the 
present invention, conditioning parameters include pressure (conditioning disk down force) and 
velocity (rotational speed of the conditioning disk), and can also include other factors, such as the 
firequency of conditioning, duration of conditioning and translational speed of conditioning disk across 

30 the pad surface. The model selects, and then maintains, polishing pad conditioning parameters within a 
range that does not overcondition the pad while providing acceptable wafer material removal rates. 
Thus the present invention provides a process for die feedforward and feedbackward control of the 
CMP polishing process. Althougji the invention is described herein with respect to the use of a disk, 
having an abrading of surface thereon, which is pushed against and moved with respect the pad, the 

35 techniques of the invention may be applied to other conditioning mechanisms. 

In one aspect of the invention, a method of conditioning a planarizing surface is provided in a 
chemical mechanical polishing (CMP) apparatus having a polishing pad against which a wafer is 
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positioned for removal of material therefrom and a conditioning disk is positioned for conditioning of 
the polishing pad. The method includes providing a pad wear and conditioning model that defines 
wafer material removal rate as a function of at least one pad conditioning parameters, said at least one 
conditioning parameter having maximum and minimum values, polishing a wafer in the CMP apparatus 
5 under a first set of pad conditioning parameters selected to maintain wafer material removal rates within 
preselected minimum and maximum removal rates, determining a wafer material removal rate 
occurring during said polishing step, calculating updated pad conditioning parameters based upon said 
determined wafer material removal rate of said step (c) and the pad wear and conditioning model to 
maintain wafer material removal rates within the maximum and minimum removal rates, and 

10 conditioning the polishing pad using the updated conditioning parameters. 

In at least some embodiments, the method includes polishing a wafer in the CMP apparatus 
under a first set of pad conditioning parameters selected to maintain wafer material removal rates within 
preselected minimum and maximum removal rates (which conditioning occurs simultaneously with 
polishing in at least some embodiments of the present invention); determining a wafer material removal 

IS rate occurring during the polishing step; calculating, based upon the wafer material removal rate, 

updated pad conditioning parameters to maintain wafer material removal rates within the noiaximum and 
minimum removal rates; and conditioning the polishing pad using the updated pad conditioning 
parameters. In at least some embodiments the polishing step includes polishing of a wafer or it includes 
polishing of two or more wafers, i.e. a polishing cycle. The wafer material removal rates can be 

20 averaged or the last polished wafer noiaterial renK>val rate can be used in updating pad conditioning 
parameters. 

The updated pad conditioning parameters are calculated using a pad wear and conditioning 
model by determining wafer material removal rate as a function of pad conditioning parameters 
including conditioning disk down force and velocity of the conditioning disk; and determining the 

25 difference between the calculated and measured wafer material removal rates and calculating updated 
pad conditioning parameters to reduce said difference, wherein the updated pad conditioning 
parameters are updated according to the equation, ^ = (^] ) + g (A: — (fej )), where /: is a measured 
wafer material removal rate, kj is a calculated wafer material removal rate, g is the estimate gain, and 
{k-( kf)) is the prediction error. 

30 In at least some embodiments, the first set of pad conditioning parameters are determined 

empirically, or using historical data, or using the results of the design of experiment (DOE), a set of 
experiments used to define the model. 

In at least some embodiments, the pad conditioning parameters of the pad wear and 
conditioning model includes frequency of conditioning, or time of conditioning, or the translational 

35 speed (a speed of motion of the disk other than disk rotation) of the conditioning disk during 
conditioning. 
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In at least some embodiments, wafer material removal rate includes measuring the wafer 
thickness before and after polishing. Calculating updated pad conditioning may include executing a 
recursive optimization process. 

In at least some embodiments, the gain, is a value used to indicate the variability or 
reliability in the measured parameter. 

In at least some embodiments, pad life is defined according to the relationship: 



where Frf/,jt is the down force applied by the conditioning disk to the CMP pad during conditioning, o) 
disk is the angular velocity of the conditioning disk during conditioning of the polishing pad, r is the 
duration of conditioning,/is the frequency of condition and T2 is the sweep speed of the conditioning 
disk during conditioning. 

In at least some embodiments, the wafer material removal rate is defined by the equation 



where Fduk is the down force applied by the conditioning disk to the CMP pad during conditioning, a> 
disk is the angular velocity of the conditioning disk during conditioning of the polishing pad, t is the 
duration of conditioning, /is the frequency of condition, and T2 is the sweep speed of the conditioning 
disk during conditioning. In at least some embodiments, the maximum value for wafer material 
removal rate is the saturation point of the wafer material removal rate vs. conditioning down force 
curve i.e., where increases in down force do not affect removal rate. In at least some embodiments, the 
minimum value for wafer material removal rate and hence minimum acceptable conditioning 
parameters is defmed by the maximum acceptable wafer polishing time. 

In at least some embodiments, the wafer material removal rate is determined according to the 
equation: 



where j?/ is the wafer material removal rate for a conditioning parameter X/, pi is the slope and // is the 
intercept of the curve of the defining the relationship between j>/ and Xi where other factors that may 
affect wafer polishing are held constant. 

In at least some embodiments, an updated pad conditioning parameter, is determined by 
solving the equation: 



where >)(+ is the target wafer material removal rate, Wi is the weighing factor for conditioning parameter 
jCf, and Ay is the t)rediction error for wafer material removal rate. 




conditioning 





Pi 
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In at least some aspects of the invention, an apparatus for conditioning polishing pads used to 
planarize substrates by removal of material therefrom includes a carrier assembly having an arm 
positionable over a planarizing surface of a polishing pad; a conditioning disk attached to the carrier 
assembly; and an actuator capable of controlling an operating parameter of the conditioning disk; a 
5 controller operatively coupled to the actuator, the controller operating the actuator to adjust the 

operating parameter of the conditioning disk as a function of a pad wear and conditioning model that 
predicts the wafer material removal rate of the polishing pad based upon polishing pad and wafer 
parameters. The conditioning down force and rotational speed of the conditioning disk is predicted by a 
model by determining wafer material removal rate as a function of pad conditioning parameters 
10 including conditioning disk down force and conditioning disk rotation rate. 

In at least some embodiments, the wafer material removal rate is determined according to the 
equation: 

where j5/ is the wafer material removal rate for a conditioning parameter x,, p/, is the slope and is the 
15 intercept of the curve of the defining the relationship between _pi and x;. 

In at least some embodiments, the updated pad conditioning parameter, is determined by 
solving the equation: 

' Wr 

T^^ » 

Pi 

where jp/+ is the target wafer material removal rate, Wi is the weighing factor for conditioning parameter 

20 jCf, and Ay is the prediction error for wafer material removal rate. 

Thus, polishing pad life is extended by using a more desirable conditioning disk down force 
and angular velocity while keeping within the acceptable range of wafer material removal rate and by 
adjusting the conditioning parameters whenever the removal rate drops below the acceptable removal 
rate. By applying a "one size fits all" approach to pad conditioning parameters (e.g., by determining 

25 conditioning parameters without accounting for a change in actual wafer material removal rates), 
conventional processes overcompensate, thereby removing more pad material than is necessary and 
accelerating pad wear. The invention thus provides more optimal conditioning parameters, i.e., only 
those forces necessary to recondition the damaged pad. 

Brief Description of the Figures 
30 Various objects, features, and advantages of the present invention can be more fully appreciated 

with reference to the following detailed description of the invention when considered in coimection 
with the following drawings. 
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Figures 1 A-B show a conventional CMP machine. Figure 1 A shows a top plan view of a 
conventional CMP noachine. Figure IB shows a side sectional view of the prior art CMP niachine from 
Figure 1 A taken through line A~A. 

Figure 2 shows the exponential decay of wafer material removal rate and the equilibrium state 
5 of the removal rate that occurs between Figure 3B and 3C. 

Figures 3A-C show the chemical reactions that occur between a polishing pad and a polishing 
slurry. Figure 3A generally shows the chemical structure of a polyurethane polishing pad and the ionic 
bonds that form between NCO groups. Figure 3B generally shows how water forms ionic bonds with 
the polyurethane polishing pad by breaking down the ionic bonds between the NCO groups in the 
10 polyurethane composition. Figure 3C generally shows how a silicon slurry forms hydrogen bonds with 
the water and the polyurethane polishing pad. 

Figures 4A-C are cross-sectional diagrams of polishing pads. Figure 4A is a diagram of a new 
polishing pad. Figure 4B is a diagram of an old polishing pad. Figure 4C shows how an old polishing 
pad can be refurbished for continued use. 
15 Rgure 5 is a flow diagram of the feedback loop used in CMP process optimization. 

Figure 6 is a flow diagram illustrating data collection and generation of a pad wear and 
conditioning model. 

Figure 7 is a graph generally showing the wafer material removal rate in view of the pressure 
exerted by the conditiomng disk on the polishing pad. 
20 Figure 8 is a graph generally showing the wafer material removal rate in view of the rotational 

speed exerted by the conditionmg disk on the polishing pad. 

Figure 9 is a model based on Figures 7 and 8 for predicting and modulating the removal rate for 
the next wafer removal. 

Hgure 10 is a side sectional view of a CMP machine for use in the method of at least some 
25 embodiments of the present invention. 

Figure 1 1 is a block diagram of a computer system that includes tool representation and access 
control for use in at least some embodiments of the invention. 

Figure 12 is an illustration of a floppy disk that may store various portions of the software 
according to at least some embodiments of the invention. 

30 Detailed Description of the Invention 

Novel methods for feedforward and feedback controls of the CMP process for maximizing the 
life of the polishing pad are described herein. Extended pad life results in reduced down time for the 
CMP process because the polishing pad can polish more wafers over a longer period of time without 
requiring replacement or adjustment (e.g., removal of the pad). The term wafer is used in a general 

35 sense to include any substantially planar object that is subject to polishing. Wafers include, in 

additional to monolith structures, substrates having one or more layers or thin films deposited thereon. 
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Most CMP pad materials comprise urethane or other polymers, which softens when exposed to 
water. Chemical reactions relating to the pads, shown in FIGs 3A, 3B and 3C, explain the process by 
which softening may occur. In particular, the isocyanate (NCO) groups in the urethane of a brand new 
pad are normally cross-linked through hydrogen bonding, as shown in FIG. 3A. As water from the 
5 polishmg slurry contacts the pad, the water interrupts hydrogen bonding in the cross-linked urethane 
structure, and forms hydrogen bonds with the urethane, as shown in FIG. 3B. When water replaces the 
cross-linked urethane structure, the pad becomes softer. Moreover, the structure in FIG. 3B may react 
with the silica (SiOz) (from material removed from the polishing process) in the slurry to create 
additional hydrogen bonds with the NCO groups in the urethane pad, as shown in FIG. 3C. The pad 

10 becomes "poisoned" as a result of the silica cbenucally reacting with the urethane structure. As water 
evaporates from the slurry, the silica hardens the pad. The hydrogen bonding of the slurry component 
and the pad blocks the mean free path of slurry movement in the pad and decreases the active contact 
areas between the wafers and the pad, so that removal rate of the wafer and surface uniformity 
decreases in the resulting polished wafers. FIG. 2 shows that the removal rate decreases over time in 

15 view of the equilibrium that occurs in the chemical reactions shown in FIGs SB and SC. Once 
equilibrium is reached, the pad poisoning will stop. 

FIGs 4A, 4B and 4C are simplified models showing pad conditioning. As shown in FIG. 4A, 
the height (or depth) 1 of the active pad sites 2 is assumed to be equivalent to the life of the pad 3. As 
the height 1 decreases, the expected further life of the pad 3 decreases. The poisoned areas 4 of the pad 

20 3 in FIG. 4B occur at equilibrium, and are chemically represented by FIGs 7B and 2C. The poisoned 
area 4 is generally physicaUy removed, as shown in. FIG. 4C, by pad conditioning, so that fresh, active 
sites 2 will again be exposed. The process shown in HG.s 4A, 4B and 4C are repeated for the entire 
life cycle of the pad until no more active sites are available. 

The chemical and mechanical processes described above during planarization and conditioning 

25 of the polishing pad provide a model for optimization of the planarization process. By using this 

model, the pad life can be extended without compromise to the wafer material removal rate by adjusting 
the conditioning parameters during wafer polishing. In particular, conditioning disk down forces (F) 
and conditioning disk rotational (or angular) velocity (rpm), and optionally other conditioning 
parameters, for example, conditioning frequency, disk translation speed, and duration of conditioning, 

30 are adjusted during the polishing operation in a feedback and feedforward loop that predicts and then 
optimizes pad conditioning operating parameters. 

According to at least some embodiments of the present invention, an initial model is developed 
based upon knowledge of the wafer polishing process, and is used in at least some embodiments of the 
present invention, as is shown in a flow diagram (FIG. 5). Based on that initial model for a given 

35 wafer polishing recipe, e.g. the wafer and polishing pad parameters remain constant, initial processing 
conditions are identified that will provide a wafer material removal rate between a preselected 
minimum and maximum value for a given set of conditioning parameters, hereinafter, the "acceptable" 
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range for wafer material removal rates. The conditions are selected to prevent overconditioning of the 
pad. In step 500, wafers are polished according to the given wafer polishing recipe using the initial pad 
conditioning paranaieters. The thickness of the polished wafer is measured and a wafer material removal 
rate is calculated in step 510, which information is then used in a feedback loop to maintain the wafer 
5 material removal rate within the accepted range. The actual removal rate is compared with die 

predicted removal rate (derived from the pad wear model). Deviations, i.e., prediction errors, are used 
to calculate pad conditioning parameters in step 520 according to the model of the invention to 
compensate for the reduced polishing capability of the polishing pad as identified in the model and/or to 
correct for any un-modeled effects. The polishing pad is conditioned according to the updated 
10 conditioning parameters in step 530. Polishing is repeated in step 540 and the polishing results are used 
to further update the polishing conditions by repeating steps 510-530. 

By maintaining the wafer material removal rate and conditioning parameters within the preselected 
minimum and maximum range, overconditioning of the pad is prevented, that is, conditioning 
parameters are su£ficient to restore polishing pad effectiveness, but do not unduly damage the pad. In 

IS operation, it may be desirable to select pad conditioning parameters that result in wafer material 
removal rates that are close to the minimum acceptable rates, as these conditioning forces are less 
aggressive and therefore are more likely to avoid overconditioning of the polishing pad. However, one 
should be cautious (or at least cognizant) about operating too closely to the minimum removal rate 
since a sudden degradation in the pad condition may cause the wafer material removal rate to drop 

20 below the minimum acceptable rate. 

As indicated previously, conventional art CMP processes do not change the conditioning down 
force (i.e., the pressure exerted by the conditioning disk on the pad) or the rotational speed uniformly 
across the surface, e.g., from conditioning event to conditioning event, where a single conditioning 
event can be, e.g., the conditioning of the entire polishing pad or a portion of the polishing pad that is m 

25 contact with the wafer during polishing. By applying a "one size fits all" approach to pad conditioning 
parameters, the conventional processes overcompensate, thereby removing more pad material than is 
necessary and accelerating pad wear. The invention thus provides far more optimal conditioning 
parameters. 

Pad conditioning optimization is carried out with reference to a specific polishing system. That 
30 is, the conditions which improve pad lifetime are specific to the type of wafer being polished, the slurry 
used in polishing and the composition of the polishing pad. Once a wafer/slurry/polishing pad system 
is identified, the system is characterized using the models developed and as discussed herein. 
Exemplary polishing pad and wafer parameters include polishing pad size, polishing pad composition, 
slurry composition, wafer composition, rotational velocity of the polishmg pad, polishing pad pressure 
35 and rotational and translational velocity of the wafer. 

In at least some embodiments of the present invention, it is envisioned that a separate model (or 
at least a supplement to a composite model) is created for each slurry/polishing pad wafer combination 
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(i.e., for each different type/brand of slurry and each type/brand of pad that may be used in production 
with a given type of wafer). 

FIG. 6 shows a flow diagram of the steps used in developing the pad wear and conditioning 
model in at least some of the embodiments of the invention. In a first step 600 of the model 
5 development as contemplated by at least some embodiments of the present invention, the relationship 
between wafer material removal rate and a first conditioning parameter xj, e,g,, conditioning disk down 
force (Fdisk)y is determined in the selected polishing system. The relationship is determined by 
measuring wafer material removal rates at different conditioning down forces with wafer parameters 
such as polishing force, polisliing duration, etc., held constant. Thus, a wafer may be polished under 

10 specified conditions, e.g., for a specified time and at specified polishing pad and wafer speeds and the 
rate of material removal may be determined. Pad conditioning and wafer polishing (the "polishing 
event") may be carried out simultaneously, i.e., using an apparatus such as shown in FIG. 10, or pad 
conditioning may be followed by wafer polishing. The conditioning down force is increased 
incrementally from wafer to wafer (or thickness measurement to thickness measurement) with all other 

15 parameters held constant, and the wafer removal rate is again determined. A curve as shown in FIG. 7 
is generated, which illustrates the effect of the conditioning disk down force on the wafer's material 
removal rate for a given polishing system (all other parameters being held constant). 

With reference to FIG. 7, in a first portion of the curve 700, the slope exhibits a linear response 
to a change in down force and is characterized by the angle 0;. The value for Oj is descriptive of the 

20 response of the polishing process to conditioning down force. The larger the value for 6u the steeper 
the slope of the curve and the more sensitive the planarization process is to conditioning down force, hi 
a second region of the curve 720, the curve flattens and becomes substantiaUy non-responsive to 
increases in conditioning down force. This is referred to as the saturation point. The onset of saturation 
is described by the angle 02^ The larger the value for 62, the more gradual the onset of saturation 

25 (poisoning). 

Minimum and maximum values for the model variables are determined in step 610 of FIG. 6. 
The saturation point identifies the maximum (or substantially the maximum) removal rate for this 
polishing system where all other polishing parameters are held constant. It likewise identifies a 
maximum conditioning down force, since additional pressure overconditions the pad and does not 

30 substantially improve polishing rate. A minimum material removal rate is dictated by production goals, 
since a minimal wafer throughput rate is needed. Thus the minimal conditioning down force is also 
defined based on throughput. Once minimum and maximum values for conditioning down force are 
defined, the range is divided into n steps, e.g. n equal steps, which encompass the acceptable working 
range for conditioning down forces. The value for n is selected so that a step in value, e.g., from x to 

35 x+1, is meaningful, for use in updating model parameters in a feedback control algorithm. 

In step 620, as contemplated by at least some embodiments of the present invention, the 
relationship between wafer material removal rate and a second conditioning parameter ;c2, e.g.. 
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conditioning disk rotational velocity, is detennined in the same polishing system in the manner 
described above for conditioning down force. With reference to FIG. 8, a curve can be generated to 
illustrate the effect of the pad rotation velocity on the wafer material removal rate (all other parameters 
held constant). Again, the applied rotation velocity is increased incrementally and the wafer material 
5 removal rate is measured for each polishing event. The region 800 exhibits a linear response to a 
change in pad rotation velocity and is characterized by the angle 63. In region 820, the curve flattens 
and becomes substantially non-responsive to increases in rotational rate. This is referred to as the 
saturation point and is described by the angle 64^ In step 630 of FIG. 6, the maximum wafer material 
removal rate and maximum rotational rate are defined by the saturation point for this polishing system, 
10 where all other polishing parameters are held constant. The minimum rotation rate is determined by the 
production-established minimum wafer material removal rate, e.g., it is based on a through put 
consideration. As above for conditioning down forces, the acceptable range for disk rotational velocity 
may be divided into m steps, e.g. of equal value, for use in updating model parameters in a feedback 
control algorithm. 

15 The models provide maximum and minimum wafer material removal rates, maxunum and 

minimum pad down forces, and maximum and minimum pad rotational rates. In addition, values for 
constants d/ » are determined. Although the above designs of experiment show a conditioning 
parameter that demonstrates an increase in wafer removal rate with increase in magnitude of the 
parameter, it is understood that the opposite relationship can exist, so that the minunal parameter value 

20 produces the maximum wafer removal rate. The models can be adjusted accordingly. Maximum and 
minimum conditions may be determined for any combination of polishing pad, wafer and polishing 
slurry known in the art. Additional pad conditioning parameters, up to xj, may be included in the naodel 
and their minimum and maximum values determined as indicated by steps 640 and 650. 

The model cem be represented as raw data that reflects the system, or it can be represented by 

25 equations, for example multiple input-multiple output linear, quadratic and non-linear equations, which 
describe the relationship among the variables of the system. Feedback and feedforward control 
algorithms can be constracted in step 660 based on the above models using various methods. The 
algorithms can be used to optimize parameters using various methods, such as recursive parameter 
estimation. Recursive parameter estimation is used in situations such as these, where it is desirable to 

30 model on line at the same time as the input-output data is received. Recursive parameter estimation is 
well-suited for making decisions on line, such as adaptive control or adaptive predictions. For more 
details about the algorithms and theories of identification, see Ljung L., System Identification - Tlieory 
for the User, Prentice Hall, Upper Saddle River, N.J. 2nd edition, 1999. 

The wear and reconditioning of the polishing pad may be modeled by eq. 1: 

35 P<'<ii^fe^f[F^.ca<^^t„,^J,T^) (1) 
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where Fduk is the down force applied by the conditioning disk to the polishing pad during conditioning, 
a>a^ is the angular velocity (rotational speed, e.g., rpm) of the conditioning disk during conditioning of 
the polishing pad, t is the conditioning time, and/is frequency of conditioning, and T2 is the sweeping 
speed of the conditioning holder as shown in the example CMP device of FIG. 10 (which will also be 
5 described in greater detail below). The pad may be conditioned in a separate step or while the wafer is 
polished, as is shown in FIG. 10. Frequency is measured as the interval, e.g., number of wafers 
polished, between conditioning events. For example, a frequency of 1 means that the pad is 
conditioned after every wafer, while a frequency of 3 means that the pad is conditioned after every third 
wafer. The sweeping speed is the speed at which the conditioning disk moves across the surface of the 
10 polishing pad. Themotionisindicatedby arrowT2inFIG. 10. For the purposes of initial 

investigation, it is assumed in at least some embodiments of the present invention that t (time), T2 
(sweep speed), and/ (frequency) are held constant. 

In at least some embodiments of the present invention, the wafer material removal rate is 
modeled according to eq. 2: 

15 RcmovalRate]Z = f{F^]Z^0>^lOj]Z>tca.^ionin,AAAAJ2) (2) 

where Faish <Odish f> tcmtditumm&T2, Ou 02, Osf 04 are defined above. TTie objective function is to maintain 
removal rates within the minimum and maximimi allowable rates (the "acceptable rates") by controlling 
the conditioning disk down forces, the rpm of the disk and, optionally, by controlling other factors such 
as frequency and duration of conditioning, and speed of translation of the conditioning disk across the 
20 pad surface, Ta. 

The CMP parameters (variables) and constants from the model may then be programmed into a 
computer, which may then constantiy monitor and appropriately vary the parameters during the process 
to improve the wafer material removal rate and the pad life, as shown in FIG. 9. Parameters from the 
base study 901 are input into the computer or other controller 902, which runs the wafer polishing 

25 process, and the estimator 903, which monitors and modifies the process parameters. The actual output 
(i.e., measured removal rate) 904 is monitored and compared to the predicted output (i.e., target 
removal rate) 905 calculated by estimator 903. The difference 906 between the actual output 904 and 
the predicted output 905 is determined and reported 907 to the estimator 903, which then appropriately 
generates updated parameters 908 for the process 902. Updating model parameters for feedback control 

30 is based on eq. 3. 

k = {k,)+g*{k-{k,)), (3) 

where ^ is a current parameter, ki is previous parameter estimate, g is the estimate gain and (k-( kj)) is 
the prediction error. Estimate gain is a constant selected by the user, which is a measure of machine 
error or variability. Gain factor may be detennined empirically or by usmg statistical methods. 
35 By way of example, a series of curves may be generated for a polishing system of interest as 

described above for determining the relationship between wafer material removal rate and conditioning 
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down force and conditioning disk rotational velocity. Curves are generated using a standard polishing 
procedure, with all polishing pad and wafer conditions held constant with the exception of the 
parameter(s) under investigation. Exemplary polishing pad and wafer parameters that are held constant 
mclude polishing pad size, polishing pad composition, wafer composition, polishing tkne, polishing 
5 force, rotational velocity of the polishing pad, and rotational velocity of the wafer. The parameters 
under investigation include at least the conditioning down force and the angular velocity of the 
conditioning disk. As is shown in greater detail in the analysis that follows, additional parameters may 
be incorporated into the model. Using curves generated as in FIGs. 7 and 8 and model development as 
shown in FIG. 6, values for Or Q4, minimum and maximum values for wafer material removal rate, 
10 conditional down force and conditioning disk rotational velocity are determined. An algorithm that 
models the wafer planarization is defined, and a first set of pad conditioning parameters may be 
determined for the polishing system of interest either empirically, using historical data or from the 
model. 

An algorithm which models the pad wear and pad recovery process is input into the estimator 

IS and a predicted wafer material removal rate is calculated based upon the model. The actual results are 
compared against the predicted results and the error of prediction is fed back into the estimator to refine 
the model. New conditioning parameters are then determined. Based upon the models described 
herein, these parameters are just sufficient to reactivate the pad surface without overconditioning. Thus* 
the smallest increment in conditioning parameters that meet the model criteria is selected for the 

20 updated conditioning parameters. Subsequent evaluation of the updated model will determine how 
good is the fit, and further modifications can be made, if necessary, until the process is optimized. 

In at least some embodiments of the present invention, the conditioning parameters are updated 
in discrete increments or steps, defined by way of example, by the incremental curves shown in FIGs. 7 
and 8. A suitable number of curves are generated so that steps are small enough to permit minor 

25 adjustments to the conditioning parameters. 

Also, in at least some embodiments of the present invention, the updated conditioning 
parameters may be determined by interpolation to the appropriate parameters, which may lie between 
curves. Interpolation may be appropriate in those instances where a fewer number of curves are 
initially generated and the experimental results do not provide a fine resolution of the parameters. 

30 While deviations from the predicted rate reflects, in part, the inabiUty of the model to account 

for all factors contributing to the process (this may be improved with subsequent iterations of the 
feedback process), deviations from the predicted wafer material removal rate over time represent a 
degradation in CMP pad polishing. By identifying and modifying the pad conditioning process to 
account for these temporal changes in polishing performance, optimal wafer material removal rates are 

35 maintained without overconditioning of the condition pads, e.g., by operating below the saturation point 
of the system. 
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An additional feature of the method is the use of gain factor to qualify the prediction error, as 
shown in eq. 3. Thus, the method suggests that the model need not correct for 100% of the deviation 
from predicted value. A gain factor may be used to reflect uncertainty in the measured or calculated 
parameters, or to "damp" the effect of changing parameters too quickly or to a too great an extent. It is 
5 possible, for example, for the model to overcompensate for the prediction error, thereby necessitating 
another adjustment to react to the overcompensation. This leads to an optimization process that is 
jumpy and takes several iterations before the optimized conditions are realized. Use of a gain factor in 
updating the parameters for feedback control qualifies the extent to which the model will react to the 
prediction error. 

10 Once the basic system is understood and optimized, it is possible to empirically vary other 

conditioning operating parameters and access their impact on pad conditioning and wafer polishing. A 
parameter, which had been set to a constant value in the initial study, can be increased (or decreased). 
The system is monitored to determine the effect this change has on the system. It should be readily 
apparent that other factors relevant to pad wear and conditioning may be evaluated in this manner. For 

IS example, conditioning frequency, which may be set to 1 in the initial study, may be increased to 2 
(every second wafer), 3 (every third wafer), etc. The system is monitored to determine where 
degradation starts and the process can be backed off to just before this point. The greater the interval 
between conditioning events, the longer the pad lifetime. Maximizing this interval without loss of 
polishing quality is contemplated as a feature of the method of the invention. 

20 It should be readily apparent that other factors relevant to pad wear and conditioning may be 

evaluated in this manner, either empirically or by mathematical modeling. By way of exanq)le, 
conditioning time (residence time of the disk on the pad), conditionmg disk translational speed, and the 
Uke may be investigated in this manner. 

It is envisioned that at least some embodiments of the present invention may be practiced using 

25 a device 1000 such as the one shown m FIG. 10. The apparatus has a conditioning system 1010 
including a carrier assembly 1020, a conditioning disk 1030 attached to the carrier assembly, and a 
controller 1040 operatively coupled to the carrier assembly to control the down force (F) and rotation 
rate (co) of the conditioning disk. The carrier assembly may have an arm 1050 to which the 
conditioning disk 1030 is attached and means 1060a-d to move the conditioning disk in and out of 

30 contact with the planarizing surface. For example, the controller 1040 may be operatively coupled to 
the moving means to adjust the height and position of the arm carrying the conditioning disk (1060a, 
1060b, 1060c, 1060d). Similar controls for control of the position and movement of the wafer may also 
be present. In operation, the controller adjusts the operatmg parameters of the conditioning disk, e.g., 
down force and rotation rate, in response to changes in wafer material removal rate. The controller may 

35 be computer controlled to automatically provide conditioning according to the calculated conditioning 
recipe. Thus, the apparatus provides a means for selectively varying the pad conditioning parameters 
over the operating life of the pad 1080 in order to extend pad life without compromise to the 
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planarization process of the wafer 1090. Other types of devices where, e.g., other components have 
their height, positions, and/or rotations adjusted are also contemplated by at least some embodiments of 
the present invention. 

Additional apparatus utilized to implement the feedforward and feedback loop include a film 
5 thickness measurement tool to provide thickness data needed to calculate wafer material removal rate. 
The tool may be positioned on the polishing apparatus so as to provide in-line, in situ measurements, or 
it may be located remote fi:om the polishing apparatus. The tool may use optical, electrical, acoustic or 
mechanical measurements methods. A suitable thickness measurement device is available from 
Nanometrics (Milpitas, CA) or Nova Measuring Instruments (Phoenix, AZ). A computer may be 
10 utihzed to calculate the optimal pad conditioning recipe based upon the measured film thickness and 
calculated removal rate, employing the models and algorithm provided according to the invention. A 
suitable integrated controller and poUshing apparatus (Mirra with iAPC or Mirra Mesa with iAPC) is 
available from Applied Materials, California. 

Exemplary semiconductor wafers that can be polished using the concepts discussed herein 
15 mcluding, but are not limited to those made of silicon, tungsten, aluminum, copper, BPSG, USG, 
thermal oxide, silicon-related films, and low k dielectrics and mixtures thereof. 

The invention may be practiced using any number of different types of conventional CMP 
polishing pads. There are numerous polishing pads in the art which are generally made of urethane or 
other polymers. However, any pad which can be reconditioned can be evaluated and optimized using 
20 the invention herein. Exemplary polishing pads mclude Epic™ polishing pads (Cabot Microelectronics 
Corporation, Aurora IL) and Rodel® ICIOOO, IClOlO, IC1400 polishing pads (Rode! Corporation, 
Newark, DE), OXP series polishing pads (Sycamore Pad), Thomas West Pad 711. 813, 815, 815-Ultra, 
817, 826, 828. 828-El (Thomas West). 

Furthermore, any number of different types of slurry can be used m the methods of the 
25 invention. There are numerous CMP polishing slurries in the art, which are generally made to polish 
specific types of metals in semiconductor wafers. Exemplary slurries include Semi-Sperse® (available 
as Semi-Sperse® 12, Semi-Sperse® 25, Semi-Sperse® D7000, Semi-Sperse® D7100, Semi-Sperse® 
D7300, Semi-Sperse® PIOOO, Semi-Sperse® W2000, and Semi-Sperse® W2585) (Cabot 
Microelectronics Corporation, Aurora IL), Rodel ILD1300, Klebesol series, Elexsol , MSW1500, 
30 MSW2000 series, CUS series and PTS (Rodel). 

An example of the algorithm for calculating the conditioning recipe from wafer material 
removal rate data may be defined as: 

yi=PiXi'\'Ii, (4) 

where j)/ is the wafer material removal rate for the conditioning parameter Xh Pi is the slope and /; is the 
35 intercept of the curve of the defining the relationship between j?/ and jc,. Letting x/ = Fjish = oidisK = 
ft X4 = tamditionuig, jcj = T2, the following relationships may be established from the model: 

^Pixj^ h forNi fki {5} 
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92^92X2-^12 forN^#X2^i^k; (6) 

P3^P3X3 + l3 forNt#X3m^k; (7) 

y4=-p4X4'\rl4 for Ni j¥x4 ^Ni^k; (8} 

h ^Psxs + Is forNn, #xs #iV«+fc- {9) 

5 where)) is the predicted removal rate, p is the slope and 7 is the intercept in each equation, N and iV+ 
represent the upper and lower boundary conditions for a particular pad conditioning parameter. Models 
of the mvention may include all or a subset of these pad conditioning parameters. 

Each of the pad conditioning factors contributing to wafer material removal rate may be 
combined in a single equation, which defines the weighted contribution of each factor to the wafer 
10 material renK)val rate. The wafer material removal rate may be defined by eq. 10, 

Wi j?2 + j>i + + VfsH (10) 

where Wi is a weigjiing factor and Wt=^ Wj •¥ W2 -^Ws + W4 W5, The weighing factors are 
determined by minimizing any penalties, e.g., materials defects, nonuniformity of deposition, etc., that 
are associated with Xi for satisfying in eq.l 0. The penalty function may be determined empirically or 
15 by using historical data. 

The prediction error for wafer material removal rate, Ap, is the difference between the predicted 
removal rate, y, and the measured removal rate, y, shown in eq. 1 1 . 

^S>=y-9 (11) 

The prediction error is used to generate an updated wafer material removal rate, The new 
20 predictor based upon the feedback eq. 12 will be: 

and optimized parameter a:/4. is determined by eq. 13. 

^ , (13) 

Pi 

where 3?i+ is the target wafer material removal rate. 
25 The optimized parameters are used to update the new CMP polishing recipe that is sent to the 

tool for use in subsequent polishing steps. Thus, the model is able to adapt as more data is received to 

improve the process without any external control over the process. 

Various aspects of the present invention that can be controlled by a computer, including 

computer or other controller 902, can be (and/or be controlled by) any number of control/computer 
30 entities, including the one shown in FIG. 11. Referring to FIG. 1 1 a bus 1156 serves as the main 

mformation highway interconnecting the other components of system 1111. CPU 1158 is the central 
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processing unit of the system, perfonning calculations and logic operations required to execute the 
processes of embodiments of the present invention as well as other programs. Read only memory 
(ROM) 1160 and random access memory (RAM) 1162 constitute the main memory of the system. Disk 
controller 1164 interfaces one or more disk drives to the system bus 1156. These disk drives are, for 
5 example, floppy disk drives 1170, or CD ROM or DVD (digital video disks) drives 1166, or internal or 
external hard drives 1168. These various disk drives and disk controllers are optional devices. 

A display interface 1172 interfaces display 1148 and permits information i&om the bus 1156 to 
be displayed on display 1148. Display 1148 can be used in displaying a graphical user mterface. 
Communications with external devices such as the other components of the system described above can 

10 occur utilizing, for example, conmmnication port 1174. Optical fibers and/or electrical cables and/or 
conductors and/or optical communication (e.g., infrared, and the like) and/or wireless communication 
(e.g., radio frequency (RF), and the like) can be used as the transport medium betwewi the external 
devices and communication port 1174. Peripheral interface 1154 interfaces the keyboard 1150 and 
mouse 1152, permitting input data to be transmitted to bus 1156. Jn addition to these components, 

15 system 1111 also optionally includes an infrared transmitter and/or infrared receiver. Infrared 

transmitters are optionally utilized when the computer system is used in conjunction with one or more 
of the processing components/stations that transmits/receives data via infrared signal transmission. 
Instead of utilizing an infrared transmitter or infrared receiver, the computer system may also optionally 
use a low power radio transmitter 1180 and/or a low power radio receiver 1182. The low power radio 

20 transmitter transmits the signal for reception by components of the production process, and receives 
signals from the components via the low power radio receiver. The low power radio transmitter and/or 
receiver are standard devices in industry. 

Although system 1111 in FIG. 11 is ilhistrated having a single processor, a single hard disk 
drive and a single local memory, system 1111 is optionally suitably equipped with any multitude or 

25 combination of processors or storage devices. For example, system 1111 may be replaced by, or ^ 
combined with, any suitable processing system operative in accordance with the principles of 
embodiments of the present invention, including sophisticated calculators, and hand-held, 
laptop/notebook, mini, mainframe and super computers, as well as processing system network 
combinations of the same. 

30 FIG. 12 is an illustration of an exemplary computer readable memory medium 1284 utilizable 

for storing computer readable code or instructions. As one example, medium 1284 may be used with 
disk drives illustrated in FIG. 1 1. Typically, memory media such as floppy disks, or a CD ROM, or a 
digital video disk will contain, for example, a multi-byte locale for a single byte language and the 
program information for controlling the above system to enable the computer to perform the functions 

35 described herein. Alternatively, ROM 1160 and/or RAM 1162 illustrated in FIG. 1 1 can also be used to 
store the program information that is used to instruct the central processing unit 1158 to perform the 
operations associated with the instant processes. Other examples of suitable computer readable media 
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for storing information include magnetic, electronic, or optical (including holographic) storage, some 
combination thereof, etc. In addition, at least some embodiments of the present invention contemplate 
that the medium can be m the form of a transmission (e.g., digital or propagated signals). 

In general, it should be emphasized that the various components of embodiments of the present 
invention can be implemented in hardware, software or a combination thereof. In such embodiments, 
the various components and steps would be implemented in hardware and/or software to perform the 
functions of the present invention. Any presently available or future developed computer software 
language and/or hardware components can be employed in such embodiments of the present invention. 
For example, at least some of the functionality mentioned above could be implemented using the C, 
C-H-, or any assembly language appropriate in view of the processor(s) being used. It could also be 
written in an interpretive environment such as Java and transported to multiple destinations to various 
users. 

Although various embodiments which incorporate the teachings of the present invention have 
been shown and described in detail herein, those skilled in the art can readily devise many other varied 
embodiments that incorporate these teachings. 
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What is claimed is: 

1. A method of conditioning a planarizing surface in a chemical mechanical polishing (CMP) 
apparatus having a polishing pad against which a wafer is positioned for removal of material therefrom 
and a conditioning disk is positioned for conditioning of the polishing pad, comprising the steps of: 

a) providing a pad wear and conditioning model that defines wafer material removal rate as a 
function of at least one pad conditioning parameters; said at least one conditioning parameter having 
maximum and minimum values; 

b) polishing a wafer in the CMP apparatus under a first set of pad conditioning parameters 
selected to maintam wafer material removal rates within preselected minimum and maximum removal 
rates; 

c) determining a wafer material removal rate occurring during said polishing step; 

d) calculating updated pad conditioning parameters based upon said determined wafer material 
removal rate of said step (c> and the pad wear and conditioning model to maintain wafer material 
removal rates within the maximum and minimum removal rates; and 

e) conditioning the polishing pad using the updated conditioning parameters. 

2. The method of claim 1, wherein the conditioning parameters comprise conditioning down 

force. 

3. The method of claim 2, wherein the conditioning parameters comprise rotational velocity of 
the conditioning disk. 

4. The method of claim 2, wherein the conditioning parameters comprise one or more 
parameters selected from the group consisting of rotational velocity of the disk, frequency of 
conditioning, duration of conditioning and translational speed of the conditioning disk. 

5. The method of claim 1, wherein the step of calculating updated conditioning parameters 
mcludes calculating parameters such that the parameter is within the determined minimum and 
maximum values. 

6. Hie method of claim 1, wherein the updated pad conditioning parameters are calculated by 
determining the difference between an output of the pad wear and conditioning model and said 
determined wafer material removal of step (c). 
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7. The inethod of claim 6, wherein said difference is adjusted using an estimate gain prior to 
calculating updated conditioning parameters, 

5 8. The method of claim 1, wherein the updated pad conditioning parameters are updated 

according to the equation = (^i ) + g * - (^j )) , where i is a measured wafer material removal rate, 
kjf is a calculated wafer material removal rate, g is the estimate gain, and (k-(k])) is the prediction error. 

9. The method of claim 1, wherein the steps b) through e) are repeated. 

10 

10. The method of claim 1, wherein the first set of pad conditioning parameters are determined 
empirically. 

11. The method of claim 1, wherein the first set of pad conditioning parameters are determined 
1 5 using historical data. 

12. The method of claim 1, wherein the first set of pad conditioning parameters are determined 
using the results of a design of experiment (DOE) used to develop the model. 

20 13. The method of claim 1, wherein the maximum value for wafer material removal rate is the 

saturation point of the wafer material removal rate vs. conditioning down force curve. 

14. The method of claim 1, wherein the minimum value for wafer material removal rate is 
defined by the maximum acceptable wafer polishing time. 

25 

15. The method of claim 1, wherein the step of calculating updated pad conditioning 
parameters in step (d) comprises executing a recursive optimization process. 

16. The method of claim 7, wherein the estimate gain is an indication of variability or 
30 reliability in the measured parameter. 

17. The method of claim 1 , wherein pad life is defined according to the equation: 

PadLife = 

where FtSsk is the down force applied by the conditioning disk to the CMP pad during conditioning, a> 
35 rftf* is the angular velocity of the conditioning disk during conditioning of the polishing pad, / is the time 
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of conditioning, /is the frequency of condition and T2 is the sweeping speed of the conditioning disk 
during conditioning. 

18. The method of claim 1, wherein the wafer material removal rate is defined by the equation 



where F^isk is the down force applied by the conditioning disk to the CMP pad during conditioning, co 
disk is the angular velocity of the conditioning disk during conditioning of the polishing pad, / is the time 
of conditioning,/is the frequency of condition, and T2 is the sweeping speed of the conditioning disk 
1 0 during conditioning. 

19. The method of claim 1, wherein the wafer material removal rate is determined according to 
the equation: 

15 where ^/ is the wafer material removal rate for a conditioning parameter Pi is the slope and Ii is the 
intercept of the curve of the defining the relationship between j'/ and x/. 

20. The method of claim 19, wherein the updated pad conditioning parameter^ JC/+, is 
determined by solving the equation: 



20 



Pi 



where is the target wafer naaterial removal rate, Wi is the weighing factor for conditioning parameter 
Xi, and 4y is the prediction error for wafer material removal rate. 

21 . An apparatus for conditioning polishing pads used to planarize substrates by the removal of 
25 material therefrom, comprising: 

a carrier assembly having an arm positionable over a planarizing surface of a polishing pad; 

a conditioning disk attached to the carrier assembly; 

and an actuator capable of controlling an operating parameter of the conditioning disk; 

a controller operatively coupled to the actuator, the controller operating the actuator to adjust 
30 the operating parameter of the conditioning disk as a function of a pad wear and conditioning model, 
the model comprising: 
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determining wafer material removal rate as a function of pad conditioning parameters including 
conditioning disk down force and conditioning disk rotation rate. 

22. The apparatus of claim 21, wherein the wafer material removal rate is determined 
5 according to the equation: 

where j>/ is the wafer material removal rate for a conditioning parameter jc/, /?/, is the slope and // is the 
intercept of the curve of the defining the relationship between jp/ and jc,. 

10 23. The apparatus of claim 22, wherein the updated pad conditioning parameter, Xi+, is 

determined by solving the equation: 



15 



20 



where is the target wafer material removal rate, Wi is the weighing factor for conditioning parameter 
Xi, and Jy is the prediction error for wafer material removal rate. 

♦ 24. The apparatus of claim 21, wherein the updated pad conditioning parameters are updated 
according to the equation ^ = (fe - 1) + g * (/: - (fc - 1)) , where k isa. measured wafer material removal 
rate, k-l is a calculated wafer material removal rate, g is the estimate gain, and (k-(k'l)) is the 
prediction error. 

25. A method of developing a pad wear and pad conditioning model for optimization of the 
pad conditioning for polishing pads used to remove material from a wafer, comprising the steps of: 

determining the relationship between at least one pad conditioning parameter and wafer 
material removal rate; and 

25 determining maximum and minimum values for each of the at least one pad conditioning 

parameters and the wafer material removal rate; and 

recording the relationships and raininium and maximum values of the at least one pad 
conditioning parameter and the waf^ removal rate. 



30 



26. The method of claim 25, wherein the at least one pad conditioning parameter con:q)rises a 
plurality of parameters and the wafer removal rate is defined as a weighted function of the plurality of 
pad conditioning parameters. 
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27. The method of claim 25, wherein the at least one pad conditioning parameters comprises 
conditioning disk down force. 

5 28. The method of claim 27 wherein the at least one pad conditioning parameter further 

comprises conditioning disk rotational rate. 

29. The method of claim 25, where the at least one pad conditioning parameter comprises one 
or more parameters selected from the group consisting of conditioning disk down force, conditioning 

10 disk rotational rate, frequency of conditioning, and conditioning disk translational speed. 

30. The method of claim 25, wherein the relationship between the at least one conditioning 
parameter and wafer removal mte is detennined by incrementally varying the conditioning parameter 
and measuring the tesultant wafer removal rate. 

15 

31. The method of claim 25, wherein the maximum value for the conditioning parameter is the 
value above which no incremental increase of the wafer removal rate is observed. 

32. The method of claim 25, wherein the minimum value for the conditioning parameter is the 
20 value which provides the minimum wafer removal rate. 

33. The method of claim 25, further comprising the steps of: 

polishing a wafer in the CMP apparatus under a first set of pad conditioning parameters 
selected to maintain wafer material removal rates within preselected minimum and maximum removal 
25 rates; 

determining a wafer material removal rate occurring during said polishing step; 

calculating updated pad conditioning parameters based upon said determined wafer material 
removal rate and the pad wear and conditioning model to maintain wafer material removal rates within 
the maximum and minimum removal rates; and 

30 conditioning the polishing pad using the updated pad conditioning parameters. 

34. The method of claim 33, wherein the updated pad conditioning parameters are calculated 
by determining the difference between an output of the pad wear and conditioning model and said 
determined wafer material removal. 



35 
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35. The method of claim 33, wherein the updated pad conditioning parameters are updated 
according to the equation k = {k- 1)+ g * (fe - (/: - 1)) , where /: is a measured wafer material removal 
rate, is a calculated wafer material removal ikte, g is the estimate gain, and (k-( kj)) is the prediction 
error. 

5 

36. A computer readable medium comprising instructions being executed by a computer, the 
instmctions including a computer-implemented software application for a chemical mechanical 
polishing process, the instructions for implementmg the process comprising the steps of: 

a) receiving data from a chemical mechanical polishing tool relating to the wafer removal rate 
10 of at least one wafer processed in the chemical mechanical polishing process; and 

b) calculating, from the data of step (a), updated pad conditioning parameters within defined 
maximum and minimum values, wherein the updated pad conditioning parameters are calculated by 
determining the difference between an output of a pad wear and conditioning model and the data of step 
(a). 

15 

37. A method of conditioning a planarizing surface in a chemical mechanical polishing (CMP) 
apparatus having a polishing.pad against which a wafer is positioned for removal of material therefrom 
and a conditioning disk is positioned for conditionmg of the polishing pad, comprising the steps of: 

(a) developing a pad wear and pad conditioning model by: 

20 (i) determining the relationship between at least one pad conditioning parameter and 

wafer material removal rate; and 

(ii) determining maximum and minimum values for each of the at least one pad 
conditioning parameters and the wafer material removal rate; and 

(ii) recording the relationships and minimum and maximum values of the at least one 
25 pad conditioning parameter and the wafer removal rate. 

(b) polishing a wafer in the CMP apparatus under a first set of pad conditioning parameters 
selected to maintain wafer material removal rates within preselected minimum and maximum removal 
rates; 

(c) determining a wafer material removal rate occurring during said polishing step; 

30 (d) calculating updated pad conditioning parameters based upon said determined wafer 

material removal rate of said step (b) and the pad wear and conditioning model to maintain wafer 
material removal rates within the n[iaximum and minimum removal rates, 

wherein the wafer material removal rate is determined according to the equation: 
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where j/f is the wafer material removal rate for a conditioning parameter jc/, ph is the slope and // is the 
intercept of the curve of the defining the relationship between pi and Xh and the updated pad 
conditioning parameter, Xi+, is determined by solving the equation: 

X. = i « 

Pi 

where 3>/+ is the target wafer material removal rate, Wi is the weighing factor for conditioning parameter 
Xh and is the prediction error for wafer material removal rate; and 

(e) conditioning the polishing pad using the updated conditioning parameters. 



10 38. A system for conditioning a planarizing surface in a chemical mechanical polishing (CMP) 

apparatus having a polishing pad and a conditioning disk, comprising: 

a) a pad wear and conditioning model that defines wafer material removal rate as a function of 
at least one pad conditioning parameters including rotation and dkection of the conditioning disk; 

b) polishing means for polishing a wafer in the CMP apparatus 

15 c) measuring means for determining a wafer material removal rate; and 

d) calculating means for updating pad conditioning parameters based upon said determined 
wafer material removal rate of said step (c) and the pad wear and conditioning model to maintain wafer 
material removal rates within the maximum and minimum removal rates. 
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